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Abstract — This paper proposes an analytical framework 
for peer-to-peer (P2P) networks and introduces schemes for 
building P2P networks to approach the minimum weighted 
average download time (WADT). In the considered P2P 
framework, the server, which has the information of all the 
download bandwidths and upload bandwidths of the peers, 
minimizes the weighted average download time by deter- 
mining the optimal transmission rate from the server to the 
peers and from the peers to the other peers. This paper 
first defines the static P2P network, the hierarchical P2P 
network and the strictly hierarchical P2P network. Any 
static P2P network can be decomposed into an equivalent 
network of sub-peers that is strictly hierarchical. This paper 
shows that convex optimization can minimize the WADT 
for P2P networks by equivalently minimizing the WADT 
for strictly hierarchical networks of sub-peers. This paper 
then gives an upper bound for minimizing WADT by con- 
structing a hierarchical P2P network, and lower bound by 
weakening the constraints of the convex problem. Both the 
upper bound and the lower bound are very tight. This pa- 
per also provides several suboptimal solutions for minimiz- 
ing the WADT for strictly hierarchical networks, in which 
peer selection algorithms and chunk selection algorithm can 
be locally designed. 

Index Terms — P2P network, weighted average download 
time, hierarchical P2P network, strictly hierarchical P2P 
network 



I. Introduction 

P2P applications (e.g, [1], [2], [3]) are increasingly pop- 
ular and represent a large majority of the traffic currently 
transmitted over the Internet. A unique feature of P2P 
networks is their flexible and distributed nature, where 
each peer can act as both server and client [1]. Hence, 
P2P networks provide a cost-effective and easily deploy- 
able framework for disseminating large files without relying 
on a centralized infrastructure [5]. These features of P2P 
networks have made them popular for a variety of broad- 
casting and file-distribution applications [5] [B] [7] [H] [5]. 
Specifically, chunk-based and data-driven P2P broadcast- 
ing systems such as CoolStreaming [6], Overcast [10] and 
Chainsaw have been developed, which adopt pull-based 
techniques [B], [7]- In these P2P systems, the peers possess 
several chunks and these chunks are shared by peers that 
are interested in the same content. An important prob- 
lem in such P2P systems is how to transmit the chunks to 
the various peers and create reliable and efficient connec- 
tions between peers. For this, various approaches have been 
proposed including tree-based and data-driven approaches 
(e.g. i [11] [12] [13] EH [IS] [16]). 

Besides these practical approaches, some research has 



begun to analyze P2P networks from a theoretic perspec- 
tive to quantify the achievable performance. The perfor- 
mance, scalability and robustness of P2P networks using 
network coding are studied in [T7] [TBj. In these inves- 
tigations, each peer in a P2P network randomly chooses 
several peers including the server as its parents, and also 
transmits to its children a random linear combination of 
all packets the peer has received. Network coding, working 
as a perfect chunk selection algorithm, guarantees every 
packet transmitted in a P2P network has new information 
for its receiver, which makes elegant theoretical analysis 
possible. Other research studies the steady-state behavior 
of P2P networks with homogenous peers using fluid models 
Uni [in] [S] ■ Most papers providing theoretical analysis for 
P2P networks assume dynamic systems with homogenous 
peers. 

This paper establishes an analytical framework for opti- 
mizing weighted average download time (WADT) for P2P 
networks with heterogeneous peers, i.e., peers with differ- 
ent download bandwidths and upload bandwidths. This 
paper focuses on static P2P networks with a single server 
and a fixed number of peers. In other words, no peer leaves 
or joins the P2P network. In the scheme of building the 
P2P network, the server collects all the download band- 
widths and upload bandwidths of the peers, and minimizes 
the WADT by determining the optimal transmission rate 
from the server or any peer to any other peer. A dynamic 
P2P system can also be modeled as a sequence of static 
P2P systems. Therefore, this study of static P2P networks 
is the first step to a complete analytical framework. We 
leave the study of dynamic P2P networks with heteroge- 
neous peers for future work. 

A static P2P network is a directed graph which has one 
root, the server, and at least one directed path from the 
root to each of the other nodes, which are peers. In a 
P2P network, peers are placed into levels according to the 
topological distances between these peers and the server. 
A peer is in level K if the length of the longest directed 
acyclic path from the server to the peer is K. A hierarchical 
P2P network is a P2P network in which each peer can 
only download from peers in the lower levels and upload to 
peers in the higher levels. Peers in the same level cannot 
download or upload to each other. A strictly hierarchical 
P2P network is a P2P network in which each peer in level 
K can only download from peers in level K — 1 and upload 
to peers in level K + 1. 

This paper shows that any static P2P network can be de- 



composed into an equivalent network of sub-peers that is 
strictly hierarchical. Therefore, convex optimization can 
minimize the WADT for P2P networks by equivalently 
minimizing the WADT for strictly hierarchical networks 
of sub-peers. This paper then gives an achievable upper 
bound for minimizing WADT by constructing a hierarchi- 
cal P2P network, and lower bound by weakening the con- 
straints of the convex problem. Both the upper bound and 
the lower bound are very tight. 

The strictly hierarchical P2P network is practical for pro- 
tocol design because peer selection algorithms and chunk 
selection algorithms can be locally designed level by level 
instead of globally designed. Minimizing the WADT for 
strictly hierarchical networks is a 0-1 convex optimization 
problem. However, if we have assigned all peers each to 
a level, then the global bandwidth allocation problem de- 
composes into local bandwidth allocation problems at each 
level, which have water-filling solutions. Several subop- 
timal peer assignment algorithms are provided and simu- 
lated. Some of these suboptimal but practical schemes can 
be used for content distribution systems, e.g. Overcast [10] . 

This paper is organized as follows. In Section [Til def- 
initions and notation for P2P networks are introduced. 
Section IIIII provides and discusses a taxonomy of over- 
lay networks. In Section IIVI the problem of minimiz- 
ing the weighted average download time is formulated and 
solved. Section [Vj presents the simulation results. Section 
IVll presents the conclusions. 

II. Setup and Problem Definition 

Consider a scenario where millions of peers would like to 
download content from a server in the Internet. The server 
has sufficient bandwidth to serve tens or hundreds of peers, 
but not millions. In the absence of IP multicast, one solu- 
tion is to form the server and the peers into a P2P overlay 
network and distribute the content using application layer 
multicast [T7| [22]. In this scenario, the content in the 
server is partitioned into chunks. Peers not only download 
chunks from the server and other peers but also upload to 
some other peers that are interested in the content. 

This paper focuses on content distribution applications 
(e.g, BitTorrent, Overcast [10]) in which peers are only 
interested in content at full fidelity, even if it means that 
the content does not become available to all peers at the 
same time. The key issue for these P2P applications is 
to minimize download times for peers. Since it usually 
takes several hours or days for a peer to download content 
in full fidelity, our work is less concerned with interactive 
response times and transmission delays in buffers and in 
the network. 

This paper studies a scheme to minimize the weighted 
average download time for a static P2P network. In this 
scheme, the server first collects the information of peers' 
weights, download bandwidths, and upload bandwidths 
and then performs a centralized optimization to find the 
best static P2P network, i.e., the optimal transmission 
rates of the transmission flows from the server or any peer 
to any other peer to minimize the weighted average down- 




Fig. 1 

The peer model 



load time. The server passes the optimal solution to the 
peers, and the peers build the connections according to 
the optimal solution. The rest of the paper will focus on 
the centralized optimization algorithm for determining the 
optimal static P2P network. 

In a static P2P network, the server with bandwidth S 
has a file, whose size is 1 unit for simplicity. There are N 
peers who want to share the file in the network. Each peer 
has download bandwidth di and upload bandwidth Ui, for 
i = 1,2,-- - ,N. These download bandwidths and upload 
bandwidths are usually determined at the application layer 
instead of the physical layer because an Internet user can 
have several downloading tasks and these tasks share the 
physical download and upload bandwidth of the user. 

It is reasonable to assume that di > Ui for each 1 < i < 
N. For the case of di < Ui for peer i, we just use the part of 
the upload bandwidth which is the same as the download 
bandwidth and leave the rest of the upload bandwidth. 

Denote the transmission rate from the server to peer j 
as rs^j and the transmission rate from peer i to peer j as 
ri—>j. The total download rate of peer j, denoted as r-,-, 
is the summation of rg^j and ri^j for all i =/= j. Since 
the total download rate is constrained by the download 
bandwidth, we have rj = rs—>j + ^Ylii-^j ^^^i — '^i ^'"''^ 
j — 1, - ■ ■ , N. Since the total upload rate is constrained by 
the upload bandwidth, we also have X^i^^j ''i^j — ^^r all 
j = 1, • • • , TV. One example of the peer model is shown in 
Fig. [TJ The download bandwidth and upload bandwidth of 
Peer 1 are di and ui respectively. Thus, the total download 
rate Vg^i -l-X]i=2 ''i^i l^ss than or equal to di. The total 
upload rate X]i=2 ^i^* l^^s than or equal to ui. 

Since our work is less concerned with interactive response 
times and transmission delays in buffers and in the network, 
the download time for each peer is dominated by 1 /r^ and 
the weighted average download time is '^i^iWi/ fi where 
Wi > is the weight of peer i. 

In a P2P network, if peer i forwards content to peer j, 
then peer z is a parent of peer j and peer j is a child of peer 
i. Each node can have several parents and several children. 
A primary goal of this paper is to minimize the WADT 
computed as '^iLiWi/ri. In the WADT computation 
could refer to the actual download rate r^"-* or the bud- 




Fig. 2 

A HIERARCHICAL P2P NETWORK WITH 4 PEERS 



geted download rate r\ . In general, rf"' < because 
the parents of a peer might not always have new content 
for sharing. The network coding strategy can be used as a 
perfect chunk selection algorithm to guarantee that a par- 
ent always has new content for its children [17j . This paper 
uses the network coding strategy as a chunk selection al- 
gorithm in P2P networks and so r|°'' = r^^"^ . Therefore, in 
the rest of the paper, we use for both budgeted download 
rate and actual download rate. 

III. Taxonomy of Overlay Networks 

This section studies graph structures of P2P overlay net- 
works. 

Definition 1: static P2P network: A static P2P net- 
work is a directed graph which has one root and at least 
one directed path from the root to each of the other nodes. 

The root node is the server which has the content to 
share. All the other nodes are the peers that are interested 
in the content. In a P2P network, peers are placed into 
levels according to the topological distances between these 
peers and the server. 

Definition 2: level of peer: A peer/node(use peer or 
node?) is in level K if the length of the longest directed 
acyclic path from the server to the peer is K. The server 
is defined in level 0. 

Definition 3: level of P2P netvi^ork: A P2P network 
has level K is the maximum level of peers is K. 

A. The hierarchical P2P network 

Definition 4: hierarchical P2P network: A hierarchi- 
cal P2P network is a P2P network in which each peer can 
only download from peers in the lower levels and upload to 
peers in the higher levels. 

Fig. [2] shows a hierarchical P2P network with 4 peers, 
in which peer i is in level i. Note that in this example, 
each peer downloads from all peers in the lower levels and 
uploads to all peers in the higher levels. However, by defi- 
nition, peers are not required to download from all peers in 
the lower levels or upload to/from? all peers in the higher 
levels. For example, the network shown in Fig. [2] will still 
be a hierarchical P2P network if the edge 1^4 vanishes. 
However, peers are not allowed to download from any peer 
in a higher level or upload to/from? any peer in a lower 



level. Also peers in the same level cannot download or 
upload to each other. 

Lemma 1: A hierarchical P2P network contains no di- 
rected cycle. 

Proof :{hy contraposition) Suppose there is a directed cy- 
cle ^ A2 ^ ••• ^ An — > Ai, then some Ai Ai+i 
or An — > Ai violates the requirement that peers in a hier- 
archical P2P network cannot upload to a peer in a lower 
level. Therefore, the network containing the directed cycle 
cannot be a hierarchical P2P network. Q.E.D. 

Lemma 2: A directed acyclic P2P network is a hierar- 
chical P2P network. 

Proof :{hy contraposition) Suppose a P2P network is not 
hierarchical, i.e., there exits a node A in level m and a 
node B in level n such that m > n and A B. Let 
S A\ ■ ■ ■ Am-i ^ ^ be the longest directed 
acyclic path from S to A and S ^ B\ ■ ■ ■ ^ Bn-\ — > B 
be the longest directed acyclic path from S to B. Since 
S ^ A\ ^ ■ ■ ■ ^ A,n-i A B is a directed path from 
S to B with length m + 1 > n, this path must contain a 
directed cycle, and hence, the P2P network is not a directed 
acyclic graph. Therefore, a directed acyclic P2P network 
is a hierarchical P2P network. Q.E.D. 

Theorem 1: The set of all hierarchical P2P networks is 
the set of all directed acyclic P2P networks. 

It is a direct consequence of Lemma [T] and Lemma [21 

B. The strictly hierarchical P2P network 

Definiation 5: strictly hierarchical P2P network: A 

strictly hierarchical P2P network is a P2P network in which 
each peer in level K can only download from peers in level 
K ~ 1 and upload to peers in level A' -|- 1. 

Fig. [3] shows a strictly hierarchical P2P network with 
3 levels. In a strictly hierarchical P2P network, peers in 
level K work together as a virtual server and upload to 
peers in level K -\- 1. In Fig. [31 peer 1 and 2 form the 
virtual server in level 1, denoted as Si. Peer 3, 4 and 5 
form the virtual server in level 2, denoted as 82- Since all 
transmission flows are between two consecutive levels, peer 
selection algorithms and chunk selection algorithms can be 
locally designed level by level. 

The relationships among the P2P network, the hierarchi- 
cal P2P network and the strictly hierarchical P2P network 
are concluded in Fig. [H 

C. Network of sub-peers 

A P2P overlay network is divisible if peers can be di- 
vided into sub-peers, and it is indivisible if peers cannot be 
divided. The division of peers can be performed at the ap- 
plication layer [4] [23]. Even for indivisible P2P network, 
peers can be conceptually divided into virtual sub-peers 
for theoretical analysis. A simple example of peer division 
is given in Fig. [5l Fig. [IJa) shows the original P2P net- 
work and Fig. [5jb) shows the network of sub-peers after 
the division of peer 1 into sub-peer lA and sub-peer IB. 

A peer division is equivalent if the transmission rates 
from the server to each peer and from each peer to each 



Fig. 3 

A STRICTLY HIERARCHICAL P2P NETWORK 



Fig. 5 
Peer division 




Fig. 6 

^ Construction of the network of sub-peers 

Conclusion on taxonomy of overlay networks 



other peer are invariant. The network of sub-peers is equiv- 
alent to the original P2P network if the peer division is 
equivalent. The peer division in Fig. [S] is equivalent if 
Ta^i = rs^iA, ri^2 = riA^2 and ra^i = rs^iB- 

Theorem 2: Any P2P network with N peers and K levels 
can be decomposed into an equivalent network of sub-peers 
that is strictly hierarchical, and has at most K levels, each 
of which contains at most N sub-peers. 

Proof of Theorem [H' In order to prove the theorem, it 
suffices to construct a strictly hierarchical network of sub- 
peers which is equivalent to the original P2P network. For 
any P2P network with N peers and K levels, denote the 
server S as node and the peers as node 1, 2, • • • , A''. Let 

Ki —{k : 3 a directed acyclic path from S 

to peer i with length k}, (1) 

and so \Ki\ < K is the cardinality of Ki. Divide peer i 
into \Ki\ sub-peers, which are denoted as sub-peer {i,k) 
for k E Ki. The sub-peer (i, k) is the part of peer i in level 
k. In the network of sub-peers, there is an edge from the 
server S to sub-peer («, 1) if and only if S" ^ z in the original 
network. We assign the transmission rate rs-.t(i,i) — rg^i- 
There is an edge from sub-peer (i, k) to sub-peer (j, fc-|- 1) if 
and only if there exits a directed acyclic path with length 
k + \ such that S ^ ■ ■ ■ i ^ j in the original P2P 



network. We assign the transmission rates level by level 
with 

^ X/ ''(i',fe-i)^(i,fc), (2) 

(i',fc-l)-»(i,fc) 

'''i^j ~ ^i^j ~ ^(i,m-l)^(j,m)i (3) 

r{t,k)~.(j,k+i) = min(r(,_fe) - ^ ^^+1), rf_,j-), (4) 

j'<j 

where ri^i^k) is the total download rate of the sub-peer (i, /c), 
ri—,j is the transmission rate from peer i to peer j in the 
original P2P network, r^^^ is the unassigned transmission 
rate from peer i to peer j until level k in the network of 
sub-peers, and r(j fc+i) is the transmission rate from 
(«, k) to (j, fc-l- 1). This division for the network in FiglSl^a) 
is shown in Fig. [S^a). 

Since \Ki\ < K for all i, the constructed network of 
sub-peers has at most K levels. There are at most N sub- 
peers in each level because each level contains at most one 
sub-peer of any peer. It is also easy to check that the 
constructed network of sub-peers is strictly hierarchical. 

In order to prove that the network of sub-peers is equiv- 
alent to the original P2P network, we need to show that 

k<K 



If i = 0; i.e., node i the server S, then 



(6) 



k<K 



If i 7^ 0, suppose for al\ k < K, 



< r 



k-l 



then 
K 



K-l 



fe=2 j'<i 



k=2 



and so 



3'<3 j'<j 

which is infeasible. Hence, there always exists some m < K 
such that 



(10) 



3'<] 



then = for k > m, ?'(i,fc-i)^(j.fe) = foi' ^ > "^1 ^-nd 
hence 



E - E ^i^,k-l)^U,k) 



k<K 



k<m 



(11) 



Finally, we need to show that the assigned transmission 
rates in the network of sub-peers are feasible, i.e., the down- 
load rate of each sub-peer is greater than or equal to it 
upload rate. Plugging j — N into (|4|), 



^{t,k)^{N,k+l) < ^(i,fc) 



E ^(i,fc)-0',fc+l)7 (12) 
j'<N 



and so 



'■ii,k) 



> 



E' 



^{i.k)-~*{j',k+i)- 



(13) 



Q.E.D. 

If edges with zero transmission rate and sub-peers with 
zero download bandwidth and zero upload bandwidth are 
allowed, divide peer i into K sub-peers, which are denoted 
as sub-peer (i, k) ior k — 1, ■ ■ ■ , K . The sub-peer (i, k) is 
the part of peer i in level k. There is an edge from the server 
S to sub-peer (i, 1) for all i. There is an edge from sub-peer 
{i, k) to sub-peer [j, A; -I- 1) for all i ^ j. The network of 
sub-peers after this division is also strictly hierarchical, and 
keeps all the transmission paths in the original network. 
This peer division for the P2P network in Fig.[Ha) is shown 
in Fig. [6l^b) , in which the dotted border sub-peers have zero 
download bandwidth and zero upload bandwidth, and the 
dotted edges have zero transmission rate. By Theorem [21 



there exits a transmission rate assignment such that the 
network of sub-peers after this peer division is equivalent 
to the original P2P network. Hence, minimizing the WADT 
for strictly hierarchical networks of sub-peers is equivalent 
to minimizing the WADT for P2P networks. 

In a very large P2P network with N peers, the length of 
the longest directed acyclic path K = max^ \Ki\ is usually 



much less than N. Denote dij,Ui 



as the download 



(7) 



(8) 
(9) 



bandwidth, upload bandwidth and the download rate of 
the part of peer i in level j, where dij > Uij and dij > 
rij. The sub-peers in level j work together as a virtual 
server to upload to the sub-peers in level j + 1. Denote 
Sj as the upload bandwidth of the virtual server in level j, 
^ ^ j ^ K , and so Sj = X^iLi niin(ui.j, J'ij). Let So — S. 

Lemma 6: Given the server with bandwidth S and the 
peers with download bandwidth di and upload bandwidth 
Ui, 1 < i < N , there exists an optimal strictly hierarchical 
network of sub-peers which achieves the minimum WADT 
such that Ti^j > Uij for all 1 < i < A^, I < j < K — 1. 



Proof: Suppose d. 



determine an optimal 
strictly hierarchical network of sub-peers which connects 
the server and the peers, and achieves the minimum 
WADT. It is clear that dij,Uij,fij is also an optimal 
strictly hierarchical network of sub-peers which has > 
Uij for any l<i<A^, I < j < K — 1, where 



(14) 
(15) 

(16) 



min(ui.j, ri_j) for I < j < K ^ 1 

■'3 = 



^ J2i=i ^i,3 for i ^ K 

iovl<j<K-l 



'^''^-^ d/-Ef=i'4j forj^/C 



The theorem is immediately proved. Q.E.D 

Corollary 1: Sj 
any 1 < j < K — 1. 



Corollary 1: Sj = Z^ili inin("ij , ''ij ) = Z^ili j for 



IV. Optimizing and Bounding Weighted Average 
Download Time 

A challenging problem for content distribution applica- 
tions is how to build a P2P content distribution network 
to achieve the minimum weighted average download time. 
The WADT of a P2P network with N peers is J2f=i m/n, 
where Wi is the weight and r.^ is the download rate for 
peer i. The weights Wi, i = 1, ■ ■ ■ ^ N , depend on the ap- 
plication. For broadcast applications such as CoolStream- 
ing [B] and Overcast [TU], in which all peers in the P2P 
network are interested in the same content, all weights of 
peers in the content distribution system can be set to be 1. 
For multicast applications such as "Tribler" jjl] , in which 
some helpers, who are not interested in any particular con- 
tent, store part of the content and share with other peers, 
weights of helpers are and weights of other peers are 1. 
In some applications, P2P systems can also partition peers 
into several classes and assign different weights to peers in 
different classes. 



A. Building Optimal P2P Networks Using Convex Pro- 
gramming 

Theoreni[2]deinonstrates that minimizing the WADT for 
P2P networks with N peers and K levels is equivalent to 
minimizing the WADT for strictly hierarchical networks of 
sub-peers with K levels, each of which contains N sub- 
peers. <C for large P2P networks. Recall that the 
bandwidth of the server is 5" = 5*0. The download band- 
width and upload bandwidth of peer i are di and ui respec- 
tively, where di > ui. Sj is the bandwidth of the virtual 
server in level j , 1 < j < K . dij, Ui^, ri^ are the download 
bandwidth, the upload bandwidth and the download rate 
of the part of peer i in level j. 

The constraints on the download bandwidths and upload 
bandwidths are dij > Uij for any I < i < N and 1 < j < 

-^j J2f=i = di and Ylf=i ^hj = ^'^^ 1 < * < 
N. The constraints on the download rates are rij < dij 
for any 1 < i < N and I < j < K. Lemma |6] shows 
that any optimal strictly hierarchical network of sub-peers 
has rij > Ui,j for any 1 < i < N and 1 < j < K — 
1. ^f^i rij = Ti is the total download rate for peer i. 
Corollary[l]demonstrates that the bandwidth of the virtual 
server in level j is Sj = X^il^i '^ij for j = 1, • • • , AT — 1. One 
has Sj-i > since the virtual server in Level j — 1 

uploads content to the sub-peers in Level j. 

All the constraints are linear and the objective function, 
Si^i'^i/^ii is a convex function. Therefore, the problem 
of minimizing the WADT is a convex optimization pro- 
gram, which can be solved by convex optimization solvers 
such as the interior-point method. The complexity of the 
interior-point method to solve this problem is o{\/K^N^) 
|25j . The solution to the convex optimization program 
provides the topology of the optimal P2P network which 
achieves the minimum WADT, however, the number of the 
peers is usually very large for most P2P applications and 
so the complexity for solving the convex program is too 
high. Fortunately, this convex program has an analytical 
achievable upper bound which is very tight to the optimal 
solution and its complexity is linear to the number of the 
peers in a P2P network. 

B. Analytical Upper Bound and Lower Bound for the Con- 
vex Optimization Program 

Theorem[T]shows that any hierarchical P2P network con- 
tains no directed cycle and vice versa. Constructing a hi- 
erarchical P2P network shown in Figl?] provides an achiev- 
able upper bound for the minimum WADT. Assume that 
S 3> maxi(Mi), which is always true practically. Without 
loss of generality, assume ui > U2 > •■■ > mat. Parti- 
tion the server into two parts 5^^^ — maxi(ui) — ui and 
5'(2) = S — ma,Xi{ui) ~ S — ui. The first part S^-^^ can serve 
all the peers with r^^^^ = Ui. The second part 5*^^^ can 
then freely serve for the peers only with the constraints 
di > rf ^ + rf ^ > u, and S^^'> > ^ Thus, the upper 




Fig. 7 

A HIERARCHICAL P2P NETWORK PROVIDING THE UPPER BOUND 

bound is the solution to the optimization problem 

N 

Minimize ^^Wi/r^ (17) 

1=1 

Subject to di > ri > Ui,i — 1,2, N, 

N 

y ri — Ui < S — max(ui). 

i=l 

The Karush-Kuhn- Tucker conditions for the optimal so- 
lution are 

Hi,Vi,\ > 0, 

- Wir~'^ + - + A = 0, 
m{ri - di) = 0, 
M'^i - = 0, 

A I ^(r-i - Ui) - (5 - max(ui)) J = 0. 

Combining the the Karush-Kuhn- Tucker conditions and 
the constraints in the problem, we obtain the upper bound 
which is X^il^i '^il'^'i with 

{^/wiR if Ui < y/mR < di 
Ui if Ui > ./wlR (18) 
di if ^/wlR > di 

where R is chosen appropriately such that X^i^i i^i ~ '"0 ~ 
S — maxi(ui). 

This upper bound is almost optimal. Consider the fol- 
lowing lower bound which is very close to the upper bound. 
Since S > — Ui) is a relaxed constraint, an lower 

bound is the solution to the optimization problem 

N 

Minimize Wj/Tj (19) 

i=l 

Subject to di > ri > Ui,i — 1,2, N, 

N 

n~Ui < S. 

i=l 



Note that this optimization problem (fTO|) differs from the 
problem P?|) only by replacing S with 5 — maxi(ui). Thus, 
the lower bound is Wi/ri with 

{.JwiR if Ui < ^/i«7i? < di 
Ui if > .JwlR (20) 
di if ,JwlR > 

where R is chosen appropriately such that ~ "0 ~ 

S. 

Since S 3> max(Mi), one has S S — maxi(ui). The 
upper bound and the lower bound are almost the same. 
Therefore, both of these two bounds are very tight, and the 
upper bound is almost an analytical solution to minimizing 
the WADT. Hence, it is sufficient to build a hierarchical 
P2P network to achieve the minimum WADT. 

C. Practical Solutions 

Compared with building networks of sub-peers and hi- 
erarchical P2P networks, it is more practical to build a 
strictly hierarchical P2P network. First, in a strictly hier- 
archical P2P network, peers do not need to be decomposed 
into sub-peers. Thus, it can have a much simpler protocol 
in network layer than a network of sub-peers. Second, in 
a strictly hierarchical P2P network, each level usually con- 
tains only tens or hundreds of peers. Therefore, one can 
locally design peer selection algorithms and chunk selection 
algorithms level by level, which only depend on a small col- 
lection of peers in the P2P network. The locally designed 
peer and chunk selection algorithm might be much simpler 
than the network coding strategy and other global designed 
peer and chunk selection algorithms. The following theo- 
rem shows that there exists a strictly hierarchical network 
of sub-peers which achieves the upper bound in the previ- 
ous subsection, and is very close to a strictly hierarchical 
P2P network. 

Theorem 3: There exist multiple upper-bound-achieving 
networks of sub-peers in which at most K — \ peers need 
to be decomposed into 2 sub-peers, where K <^ N is the 
number of levels. 

Proof: Equation (fT8|) gives r^, i = ,N, for the 

hierarchical P2P network which achieves the upper bound. 
Convert the hierarchical P2P network to multiple optimal 
strictly hierarchical networks of sub-peers by the following 
algorithm. 

This algorithm indicates that at most K — 1 peers need 
to be divided into 2 sub-peers and each level of the con- 
structed network of sub-peers contains at most 2 sub-peers. 
Since there are multiple choices in step (2), this algorithm 
provides multiple strictly hierarchical networks of sub-peers 
which are very close to a strictly hierarchical P2P network. 
Q.E.D. 

Another practical solution is to solve the problem of min- 
imizing the WADT directly for strictly hierarchical P2P 
networks. This problem can be formulated as a 0-1 con- 
vex optimization. The complexity to solve a 0-1 convex 
optimization problem is exponential to the problem size. 

Fortunately, this problem has an analytical suboptimal 
solution. Suppose there are N peers and K levels in a 



Algorithm 1 Peer Placement Algorithm 
1: Initialize level ^ = 1; 

2: Initialize the rest server bandwidth for the current level 

3: Initialize the set of the rest of the peers G = 

1,2,--- ,iV; 

4: while G is not empty do 
5: Choose any peer i from the set G; 
6: if s > ri then 
7: Put Peer i in the current level l; 

8: Set G = G\{i}; 

9: Set s = s — r,;; 

10: else 

11: Decompose Peer i into 2 sub-peers with r'"'^-' = s, 

(2) (1) • r (l)^ (2) 

mm{u^,rl 

12: Put the sub-peer (1) in level I and the sub-peer (2) 
in level I + 1; 

13: Set S — (X^Pocr j in level l^j) ~ i 

14: Set Z = / + 1; 
15: Set G = G\{i}; 
16: end if 

17: end while 



strictly hierarchical P2P network and the level location 
of each peer is already given. Then the global optimiza- 
tion problem can be decomposed into K local optimization 
problem and all of them have an analytical "Water Filling" 
solution. 

In particular, suppose the bandwidth of the server is 
S = Sq. The download bandwidth, upload bandwidth and 
the allocated download rate of Peer i are di , Ui and re- 
spectively. Sj , the bandwidth of the virtual server in Level 
j, is equal to the summation of the upload bandwidths of 
the peers in Level j. If Sj is greater or equal to the sum- 
mation of the upload bandwidth of all peers in level j + 1, 
then one has di > r, > u, and Sj+i = X^Peor » in level j 
If Sj is less than the summation of the upload bandwidth 
of all peers in level j -I- 1, one has < < and 
= Sj = X^Peer i in level 3 ^i- Therefore, no matter how 
to allocate download rates to peers in each level, the band- 
width of the virtual server in each level is fixed as long 
as the level positions of peers are fixed. Thus, the global 
minimum weighted average transmission rate problem can 
be decomposed to K local optimization problem. Each of 
them has a fixed server bandwidth and fixed number of 
peers in the level. Thus, if the virtual server Sj is greater 
or equal to the summation of the upload bandwidth of all 
peers in level j + 1, then the local optimization problem is 

Minimize Wi/ri 

i=l 

Subject to di > ri > Ui, i = 1, 2, • • • , iVj+i 
^ri< Sj 



where A'^+i is the number of peers in level j + I, for j ~ 
,K-1. 

By the Karush-Kuhn- Tucker conditions, the optimal so- 
lution is 

{y^R if Ui < y/w^R < di 
Ui if Ui > y/wlR (21) 
di if y/wiR > di 

where R is chosen appropriately such that Y^fj^i^ n = Sj. 

If the virtual server Sj for level j + 1 is less than the 
summation of the upload bandwidth of all peers in level 
j + 1, then the local optimization problem is 

Minimize Wi/ri 

i=l 

Subject to iti > > 0, z = 1, 2, • • • , Nj^i 

N, + i 
i=l 

where A'j+i is the number of peers in level j + I, for j =^ 
O,--- ,K-1. 

By the Karush-Kuhn- Tucker conditions, the optimal so- 
lution is 

J. = f y/wiR if < y^R < Ui .22) 
' \ Wi if Ui < y/wlR 

where R is chosen appropriately such that 'Yl!i=V ~ ^j- 
V. Results 

In this section, we simulate and evaluate the perfor- 
mances of the methods provided in Section IIVI Note that 
in this paper the download bandwidth d and the upload 
bandwidth u are decided at the application layer instead 
of the physical layer. Thus, the download bandwidth d can 
be continuously distributed in a large range such as 10 kbps 
to 100 Mbps. We assume that the download bandwidths 
di, i = 1, • • • ,N are independently identically distributed 
(i.i.d.) with uniform distribution over 2 — where /3 
is a very small positive number such as 10~^, 10~^. This 
distribution has the normalized mean value 1 and large 
maximum to minimum ratio (2-/3)/ /3. In practice, this 
parameter /3 can be determined as the minimum down- 
load bandwidth which is required by the P2P system. We 
also assume that the upload bandwidth Ui is uniformly dis- 
tributed over [adi,di], where < a < 1 is the minimum 
upload to download bandwidth required by the P2P sys- 
tem. 

A. A small network simulation 

In this small simulation, the bandwidth of the server is 
10 and the number of the peers in the network is A'^ = 
100. The download bandwidths di, i = I,-- - ,100, are 
i.i.d. with uniform distribution over [0.01,1.99], i.e., /3 = 
10^^. For each peer i, the upload bandwidth is uniformly 
distributed over [OAdi,di], i.e., a = 0.1. The weights for 



all the peers are equal and normalized to be 1/A^ — 1/100, 
and so J2t = 1. Thus, the WADT is We wiU 

compare 7 methods in this experiment. They are 

• 1. Optimal solution to the convex optimization program 
with level = 30. The convex program solver is CVX 

m- 

« 2. The upper bound (|17p by constructing a hierarchical 
P2P network. 

• 3. The lower bound by relaxing the constraints. 

• 4. The suboptimal solution to the 0-1 convex optimiza- 
tion problem for strictly hierarchical P2P networks. The 
number of the levels are set iohe K — 30 and we randomly 
put around N/K peers in each level. 

• 5. The suboptimal solution to the 0-1 convex optimiza- 
tion problem for strictly hierarchical P2P networks. We 
convert the upper-bound-achieving hierarchical P2P net- 
work (Method 2) to a network of sub-peers by Algorithm 
[TJ The peers are placed into levels according to the con- 
structed network of sub-peers, which is very close to a 
strictly hierarchical P2P network. 

« 6 This is a trivial upper bound such that the rate of each 
peer is the same as the upload bandwidth. 

• 7 This is a trivial lower bound such that the rate of each 
peer is the same as the download bandwidth. 

The distribution of the weighted average download times 
of 500 experiments are shown in Fig. [HI The distribution of 
the difference and the relative difference between Method 
3 and other methods are shown in Fig. [H] and Fig.[Tni The 
weighted average download times of Method 1 is concen- 
trated in the range of [1.7,4.1] with mean value 2.877. 
Method 2 and Method 3 provide the upper bound and 
lower bound for the minimum WADT. Fig. [51 [HI and [TOj 
show that these two bounds are very tight. In this case, the 
lower bound is almost always the same as the optimal solu- 
tion since the distribution curves of Method 1 and Method 
3 perfectly match. The distribution curve of Method 2 is 
slightly different from that of Method 1 , which means that 
the upper bound is only slightly larger than the optimal 
solution for most of the experiments. In this simulation, 
the bandwidth of the server is 10, which is not much larger 
than the maximum of the upload bandwidths. That is why 
the upper bound is still slightly different from the optimal 
solution. The distribution of Method 5 shows that in most 
of the experiments, the WADT of Method 5 is close to the 
optimal solution, however, it is sometimes much larger than 
the optimal solution. This is because the peer placement 
in Method 5 is usually very good but sometimes bad. Note 
that it is NP complete complex to find the best peer place- 
ment. In order to improve Method 5 without increasing the 
complexity, we need provide a better but still low-complex 
peer placement algorithm. The performance of Method 4 
is much worse than the performance of Method 5 because 
the peer placement algorithm is Method 4 is much worse 
that in Method 5. 

The bandwidth usage is the ratio of the total transmis- 
sion rate and the total download bandwidth of all the peers 
in the network. For a fixed WADT, it is clear that the lower 
bandwidth usage the better. However, it is a trade off to 
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Distribution of the WADT for different methods. 
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Distribution of the relative difference of the WADT 
BETWEEN Method 3 and other methods. 
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Distribution of the differences of the WADT between 
Method 3 and other methods. 
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Distribution of the bandwidth usage for different methods. 



decrease the bandwidth usage and to decrease the WADT, 
which is also verified in Fig. [11] Fig. [Tl] shows the distri- 
bution of the bandwidth usage of different methods. The 
distribution of Method 1, the optimal solution, is exactly 
the same as the distribution of Method 3, the lower bound. 
The bandwidth usage of Method 2, the upper bound, is 
slightly less than that of Method 1 and the bandwidth us- 
age of Method 5 is slightly less than that of Method 2. It 
is verified that the method with smaller WADT always has 
higher bandwidth usage. The mean values of the weighted 
average download times and the bandwidth usage of differ- 
ent methods are listed in Table HI 

For one typical experiment, the plots of the virtual server 
bandwidths versus the levels for different methods are 
shown in Fig. [121 For Method 1, the number of the levels 
K is manually chosen. In this experiment, if is 30. One can 



TABLE I 

The mean values of the weighted average download times 
(W.A.D.T.), the normalized weighted average download 
times (N.W.A.D.T.)and the bandwidth usage (B.U.)of 
different methods for experiment a. 



Method 


W.A.D.T. 


N.W.A.D.T. 


B.U. 


7 


2.468 


0.854 


1.000 


3 


2.877 


1.000 


0.650 


1 


2.877 


1.000 


0.650 


2 


2.947 


1.025 


0.633 


5 


3.461 


1.200 


0.621 


4 


5.338 


1.849 


0.471 


6 


6.361 


2.194 


0.551 
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Fig. 12 

Bandwidth of virtual servers for different methods. 



see that the bandwidths of the virtual servers from level 5 
to level 30 are linearly decreasing to for Method 1. The 
bandwidth of the virtual server generated by the last level 
is almost 0. In other words, all the upload bandwidths of 
the server and the peers are fully used. This is the reason 
why the performance of Method 3, the tight lower bound, 
is almost the same as that of Method 1. For Method 2, the 
number of the levels needed is automatically solved. It is 
15 in this experiment. 

B. A large network simulation 

In this simulation, the bandwidth of the server is 50 and 
there are 4000 peers in the network. The download band- 
widths di, i = 1, ■ ■ ■ ,4000, are i.i.d. with uniform distri- 
bution over [0.01,1.99]. The upload bandwidth Ui is uni- 
formly distributed over [O.ldi, c?i]. The weights for all the 
peers are equal and normalized to be 1/A^ = 1/4000, and 
so = 1- 

The distribution of the weighted average download times 
of 800 experiments are shown in Fig. [131 The distribu- 
tion of the difference and the relative difference between 
Method 3 and other methods are shown in Fig. [T3] and 
Fig. [151 These figures show that the lower bound and the 
upper bound provided by Method 3 and Method 2 are very 
close. The performance of Method 1 should be between 
these two bounds, although we don't simulate Method 1 in 
this case. In this simulation, the bandwidth of the server 
is 50, which is much larger than the maximum of the up- 
load bandwidths. That is why S S — maxi(wi) and the 
upper bound is almost the same as the lower bound. The 
performance of Method 5 is worse than that of Method 2 
but still a lot better that of Method 6 and 7. 

The bandwidth usages of these methods are shown in 
Fig. [161 The distribution of Method 2 is almost the same 
as the distribution of Method 3, which verifies the almost 
same performance of Method 2 and 3. The mean values of 



0.45 
0.4 










Melbod 7: irivial lowe 
^elhod 2: light upper 
Method 3: Light iower 


bojnd 




11 






Method 5: sLncliy hier 
Method 4: slricliy hier 
Method 6: Irtviai uppe 


rchical with suboptimal placements 
rchical with random placements 


0.35 
0.3 














0.25 














0.2 
















0.15 
















0.1 






/ 










0.05 


























'eighted average download lir 



Fig. 13 

Distribution of the WADT for different methods. 
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Distribution of the differences of the WADT between 
Method 3 and other methods. 



the weighted average download times and the bandwidth 
usage of different methods are listed in Table[ni Combining 
the simulation results, the comparison of these 7 methods 
in different criteria are listed in Table [ml 

VI. Conclusions 

This paper proposes an analytical framework for peer-to- 
peer (P2P) networks and introduces schemes for building 
P2P networks to approach the minimum weighted average 
download time (WADT). In the considered P2P framework, 
the server, which has the information of all the download 
bandwidths and upload bandwidths of the peers, minimizes 
the weighted average download time by determining the 
optimal transmission rate from the server to the peers and 
from the peers to the other peers. 

This paper first defines the static P2P network, the hi- 
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Distribution of the relative difference of the WADT 
BETWEEN Method 3 and other methods. 
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Distribution of the bandwidth usage for different methods. 



TABLE II 

The mean values of the weighted average download times 
(W.A.D.T.), THE normalized weighted average download 
times (N.W.A.D.T.)and the bandwidth usage (B.U.)of 
different methods for experiment B. 



Method 


W.A.D.T. 


N.W.A.D.T. 


B.U. 


7 


2.659 


0.6898 


1.000 


3 


3.854 


1.000 


0.562 


1 








2 


3.870 


1.041 


0.562 


5 


4.666 


1.210 


0.552 


4 


6.709 


1.740 


0.438 


6 


6.804 


1.765 


0.550 



TABLE III 

Comparison of the methods in the criteria of the weighted 
average download time (w.a.d.t.), the bandwidth usage 

(b.u.), the complexity of the computation in the server (s. 
comp.) and the complexity of the algorithms in peers (p. 

COMP.) 



Method 


W.A.D.T. 


B.U. 


S. Comp. 


P. Comp. 


1 


Optimal 


Highest 




High 


2 


Almost Opt. 


Very High 


0{N) 


Low 


5 


Good 


High 


OiN) 


Low 


4 


Bad 


Low 


0{N) 


Low 


6 


Bad 


Low 


OiN) 


Low 



erarchical P2P network and the strictly hierarchical P2P 
network and studies the graph structures of these P2P net- 
works. The main result is that any static P2P network 
can be decomposed into an equivalent network of sub-peers 
that is strictly hierarchical. Therefore, convex optimization 
can minimize the WADT for P2P networks by equivalently 
minimizing the WADT for strictly hierarchical networks 
of sub-peers. This paper then gives an achievable upper 
bound for minimizing WADT by constructing a hierarchi- 
cal P2P network, and lower bound by weakening the con- 
straints of the convex problem. Both the upper bound and 
the lower bound are very tight. 

The strictly hierarchical P2P network is practical for pro- 
tocol design because peer selection algorithms and chunk 
selection algorithms can be locally designed level by level 
instead of globally designed. Minimizing the WADT for 
strictly hierarchical networks is a 0-1 convex optimization 
problem. However, if we have assigned all peers each to 
a level, then the global bandwidth allocation problem de- 
composes into local bandwidth allocation problems at each 
level, which have water-filling solutions. Several subop- 
timai peer assignment algorithms are provided and simu- 
lated. 
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